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Foreword 


This document exposes a few fundamental proofs about Turing machines and Wang tiles. They 
complement the lectures given during the “Contemporary Mathematics” summer school, in Dubna 
(Russia) between 19th and 30th July, 2018. We would like to thank the organizers for giving us the 
opportunity to speak there, as well as for making the whole event pleasant and interesting. 


These notes were written for readers without deep knowledge about any specific area of mathe- 
matics. The prerequisites are: basic operations on integers (both in N and in Z), decomposition of 
an integer over a base (like 13 = 1101 in base 2), the concept of sequence and the concept of function. 
We will occasionally use a few more advanced things, but you can always skip them and continue to 
read: you should be able to understand anyway. 

On the other hand, just because you need little knowledge to start studying tilings, doesn't mean 
that the subject is easy. Some proofs presented (or outlined) here took years to develop. To make 
things worse, many of the original research papers (and even later books) on the subject omit a 
lot of details and let the readers do the work. In these notes, we tried to make the exposition as 
precise as possible: consequently, the result is sometimes a bit cumbersome to read. Please do not 
be afraid by the symbols or the definitions: the underlying ideas are often very simple. Take your 
time, read things slowly, and skip parts that seem too confusing. You do not need to understand all 
the technical details in order to extract interesting ideas from these notes. 


There are lectures, divided into sections, divided into paragraphs. Thus a section number looks 
like 2.4 (Lecture 2, Section 4) and a paragraph number looks like 2.4.3 (Lecture 2, Section 4, Para- 
graph 3). When we want to refer to another paragraph, we write something like VERI 

Numbers between brackets like iB refer to the bibliography at the end of the notes. Numbers 
between parentheses like (2.1) refer to an equation (Chapter 2, Equation 1). 


If you have any question (or comment for improving these notes), please do not hesitate to 
contact us. Our email addresses appear on the first page of this document. You can use either 
English, Russian or French. An up-to-date version of these notes should be available at 


/www.mccme.ru/dubna/2018/courses/gamard-pchelina.html and, otherwise, please contact one 


of us. 


Thanks for reading and attending! 
Daria and Guilhem 


Lecture 1 


A mathematical definition of computation 


1.1 Introduction 


1.1.1. The purpose of this lecture is to show how we can talk mathematically about computers and 
programming. Our first step is to give a definition of a computer program. Since we want to build 
a mathematical theory, we won't rely on the specific details of any programming language; on the 
other hand, we would like to use the intuition that we have when writing code, so we will try to 
capture what underlies most programming languages available today. 


1.1.2. There are two very important concepts found in (almost) all programming languages: source 
code and the memory. 

The memory is just a huge array of integers, while the source code is a sequence of instructions 
that tells how to change the memory. These instructions sometimes depend on a condition, like 


if x < 10, then r+ zv — 10; else zx +1 
or should be repeated several times in a loop, like 
while r1, do x + 2/2. 


Therefore, the best representation of source code is a graph, like the one on Figure (don't pay 
attention to the symbols in the nodes for now). Using graphs, instead of text, for the source code 
frees us from the syntactic details of the language: we do not need to care whether a loop is written 
while ... endor loop ... end: a loop is just a cycle in the graph. 

Memory is less simple than it seems at first. Indeed, each memory cell can only contain finitely 
many different values: for instance, the integers between 0 and 2?? — 1 (included). If memory cells 
were able to contain abritrarily large integers, the resulting computer would be much more powerful 
than what we have today. Therefore this constraint is important, and we'll need to take it into 
account in our mathematical model. This means that we'll have to encode all the objects on which 
we would like to compute (integers, real numbers, functions, graphs, etc.) into sequences of integers 
bounded by 2°? — 1. 


1.1.3. Programs have inputs and outputs. Those concepts do not have any precise definition: the 
input of a program may be a file stored on the disk, something typed by the user on a keyboard, 
or even something pointed on a screen by a mouse. Similarly, the output may be written on a disk, 
displayed on a screen, sent to a printer, sent to another computer over the Internet, etc. 

Our mathematical model will not deal with all these complicated kinds of input and output. 
We will assume that the input of our program has been loaded in memory somehow; we'll perform 
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computations, and then write the output back in the memory. The details about how this input 
arrived in memory and what the output becomes after we write it are not important to us. 


1.1.4. Once we have a mathematical definition of a program, the first thing to do is to check that 
it is a good definition. Ideally, we would like that our mathematical “programs” are “equivalent” to 
some real programming language, such as Python. If we do this, then all the things that we prove 
mathematically apply at least to the Python programming language, so they have some practical 
consequences. This is easier said than done, though, because the Python programming language 
itself has no precise mathematical definition. We will see how to overcome these problems. 

Finally, we can start to prove interesting things about programs. We will only show two easy, 
but very important results here: the Universality theorem, which says that what is true for one 
programming language is (to some extent) true for all of them; and the Halting theorem, which says 
that some computations cannot be done by a computer program. 


1.2 Definition of Turing machines 


1.2.1. Let I denote the set (0,...,2% — 1). Theoretically we could take any finite set for I (with 
at least 2 elements), but the integers modulo 2% is a popular choice for electronic computers these 
days. 

A memory is an array of unbounded length, where each cell contains an element of I. (Formally, 
a memory is a function from N to I.) Having arrays of unbounded length may seem unrealistic, but 
we will never use infinitely many cells at the same time. Thanks to this definition, the memory will 
always be big enough to run our (theoretical) programs, so we will never have to worry about running 


out of memory. 


1.2.2. A head is a device which moves on a memory and is able to read and write numbers in cells, 
one cell at a time. There are four possible operations: L, R, W; and T; (where i is an integer in I): 


e L means “move the head by one cell to the left”; 

e R means “move the head by one cell to the right”; 

e W; means “write the integer ¿ in the cell under the head”; 

* T; means “test if the cell under the head is equal to i” (more on this later). 


In W; and in 7;, the integer i is part of the operation. Intuitively, ¿ is a constant “hardcoded in the 
source code" of a program. Figure illustrates the memory and the head. 


1 1 2 3 5 8 12 21 34 0 


Head 


Figure 1.1: The memory (filled with random numbers) and the head. 


Note that in electronic computers, memory does not quite work like that. Indeed, these computers 


usually have less than 2% memory cells, so they can use elements of I to tell in which memory cell 


they want to read or write. However, our theoretical model allows to take any finite set for I, even 
one with two elements, and also requires that the memory is unbounded. Considering a head which 
moves one cell at a time solves these problems: we do not use Il at all to talk about the memory. As 
we will see in the next section, this simplification does not change what is possible or not: computers 
with electronic memory and computers with heads can do exactly the same computations. 


1.2.3. A Turing machine (or a program) is a graph with labeled nodes, such that one node is labeled 
start, one node is labeled end, and each other node is labeled with an operation (as defined in 
91.2.2). Moreover the edges satisfy the following restrictions: 


e if a node is labeled start, L, R, or W;, then it has only one outgoing edge; 
e if a node is labeled 7;, then it has exactly two outgoing edges (labeled true and false); 
e if a node is labeled end, then it has no outgoing edge. 


Note that several different vertices might be labeled with the same operation. 


Intuitively, each node is an operation and an edge going out of a node points to the next operation 
to perform. The operations L, R and W; only have one possible outcome, so there is an unique 
outgoing edge. By contrast, T; is a test, so it has two possible outcomes: either the condition is true, 
or it is false. The edge true is followed is the first case, and the edge false is followed in the second 
case. 

Figure shows an example of a Turing machine. 


false 


Figure 1.2: Example of a program (that doesn't do anything interesting). 


1.2.4. A configuration is a tuple (P, v, M, h) where: 


+ P is a Turing machine (see 91.2.3); 


e v is a node of P; 


* M is a memory (a function N > I); 
e his a nonnegative integer (i.e. h € N). 


Intuitively, P is the program that we are running, v is the next instruction we're about to execute, 
M is the contents of the memory, and is the current position of the head in the memory. In short, 
a configuration is the state in which the computer is at a given point of time. 


If (P, v, M, h) is a configuration, then the successor configuration is the configuration we get after 
executing the instruction in v. Let O denote the label of v. If v has only one outgoing edge, then let 
v' denote the target of that edge; if v has two outgoing edges, then let v' and v" denote respective the 
targets of these two edges. The successor configuration of (P, v, M, h) is formally defined as follows: 
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e if O = start, then the successor is (P, v’, M, Rh); 
e if O = L, then the successor is (P, v', M, max(h — 1,0)); 
e if O = R, then the successor is (P, v', M, h + 1); 


e if O = Wi, then the successor is (P, v’, M', h), where M’ is defined as M'(h) = i and M'(n) = 
M (n) for n A h; 


» if O =T; and M(h) =i, then the successor is (P, v’, M’, h); 
if O = T; and M(h) Z i, then the successor is (P, v", M', h); 


e if O = end, then there is no successor. 


We write succ(c) for the successor of a configuration c. 


1.2.5. Let co denote a configuration and for each positive integer n, let c, +1 = succ(c,), if it exists. 
The sequence co, c1, C2, ... is called a computation. We write c, = (P, Un, Mn, hn) for each n where 
Cn is defined. 

If we have label(v,) = end for some integer n, then the computation is finite, because cn has no 
successor configuration. In this case we say that the computation terminates in n steps. Moreover, 
we'll say that My is the input of the computation and that M,, is the outputp. 

On the other hand, if the computation never reaches a configuration c, with label(v,) = end, 
then the sequence co, c1, C2,... is infinite. We say that the computation does not terminate on input 
Mp. 


If we have a Turing machine P and a memory Mo, well write T(Mo) for the output of the 
compuation starting with cy = (P, start, Mo, 0), if that computation terminates. Otherwise, P(Mo) 
is not defined. We will also say things like the output of P on Mo is... to talk about P(Mp). 


1.2.6. A computation may be represented as a space-time diagram, such as the one on Figure El 
A diagram consists of an infinite grid, where line number n represents the nth configuration of the 
computation. Each cell of the grid represents one cell of the memory, and the head is drawn inside 
the cell on which it is. 


1.3 Turing machines compute functions on integers 


1.3.1. Now we have formal definitions of a program, a computation, an input and an output. However, 


our inputs and outputs only talk about sequences of elements of I. In many cases, it will be much 
more convenient to talk about natural (nonnegative) integers; we mean all the integers, not just up 
to 2% — 1. We now show how to encode natural integers (i.e., any element of N) into memory. 
Since each memory cell contains an element of I = {0,..., 2°? — 1}, it is natural to write integers 
in base 2%, so that each integer is a sequence of cells. However, the memory of a Turing machine is 


infinite, while the decomposition of an integer in a base (whatever the base) is always finite. We need 
to fill the rest of the memory with something; in computer science, it is common to fill unused memory 
with zeroes. Therefore, we could write that the representation of an integer is the decomposition of 
this integer in base 2%, followed by infinitely many zeroes. 

Here we run into a big problem: we have no way to detect when the representation of an integer is 
finished! Consider 1000000000000000001 (written in base 2%), for instance. We cannot tell whether 
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Figure 1.3: A space-time diagram of a Turing machine. 
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a long sequence of zeroes is part of an integer, or if it is just the infinite filling that comes after the 
representation. We need a way to mark the end of an integer in memory. 


1.3.2. The easiest solution is to write integers in base 2?!, and to use the digit 2?! itself (which never 
appears in decompositions in base 2?!) as a marker of ending. For convenience, we will write the 
symbol # for the digit 2?!. Moreover, we will write integers with the lowest-significant digit first, 
which is the reverse of the usual convention. If n is an integer and ng... Nino is its decomposition in 
base 2?!, then the memory containing just n looks like this: 


no | Ty | Ng |... Nk # 0101010 
If n is an integer, we note 7 its representation. 


1.3.3. For any integer n € N, there is a Turing machine that, starting from a memory full of zeroes, 
writes n on the memory. This machine is displayed on Figure 1.4. Note that, at the end of the 
execution, the head is positioned after # 


C46) 49 (8)-49—-— 9-48) 


Figure 1.4: Turing machine writing the integer n = nj ...n4ng in the memory. 


1.3.4. Let T' denote a Turing machine and f a function defined on a subset of N. We say that P 
computes f if and only if: 


e f is defined on an integer n if and only if P terminates on input m; 


e fín) = P(N) for each n where f is defined. 


We say that a function f is computable if and only if there exists a program P such that P 
computes f. If f only returns booleans (0 or 1), we sometimes say that f is decidable instead of 
computable. The following question is fundamental in computer science: 


Question. Is there a function f, defined over a subset of the integers, which is not computable? 


We will see the answer later in this lecture. If you are impatient, you can try a counting argument. 
(How many functions are there? How many Turing machines are there?) 


1.3.5. The composition of two computable functions is computable. 


1.4 Turing machines can do arithmetic 


Exercise 1.4.1. Let A denote the set {+, —, x,—, «, 2). We suppose that the operators < and 
= return the integer 1 instead of True, and the integer 0 instead of False. Let m,n denote two 
arbitrary integers. For each + in A, design a Turing machine that, on input m#n#: 

terminates and outputs 4#, where / = m T n. Thus, the memory looks like this after the execution: 
£o | à |» | & | HE} 0] 0) 0 
and the head is positioned after the #. 


mo Mil. | me | #] no] na]. 


1.4.2. Using the Turing machines from 81.3.1 and Exercise 1.4.1 we can translate any arithmetic 
expression that contains integer constants and operators from A. Given an expression, we define P 
as follows: 


e Pli], where i is an integer constant, is just the machine defined in 81.3.1; 


e Plei f e2] is defined as: 


start|— P[ei]| >| Ple] | 2| P[1]| >| ena 


where P[1] is a Turing machine from Exercise "m 


Let us look at an example. The expression 2 + 3 x 7 — 4 will be translated as (mind the priority of 
x over + and —): 


start | | P[2] >| PB] | P[7] 5 P[x]| 3| P[+]|— | P4] | P[=]|= Lend 


To understand how this works, let us observe the contents of the memory after each box is executed 
(we do not write the infinite part of the memory full of zeroes at the end): 


after start: nothing (only zeroes) 
after P2: 24 

after P3: 2434 

after P[7]: 243474 

after P[x]: 24214 

after P[+]: 234 

after P[4]: 23444 

after P[-]: 19# 


As you can see, we can visualize the memory as a stack of integers, where the top of the stack is 
on the right. A block P [i], for an integer i, pushes the integer i on the top of the stack. A block PT[j], 
for an operation +, pops two integers from the stack, computes the result of the operation on these 
two numbers, and pushes the result back on the stack. (We should remember to move the head to 
the left before each operation.) 

It turns out that any arithmetic expression can be evaluated on this stack form. The Forth 
programming language, for example, is based on stacks 


Exercise 1.4.3 (Optional). Write a Python class that models an arithmetic expression, i.e., a binary 
tree where internal nodes have labels in .A and leaves have labels in N. Using the class Node from 
81.5.4 below, write a translator from arithmetic expressions to Turing machines. 


1.5 A Turing machine interpreter in Python 


We saw that Turing machines are able to perform the usual arithmetic operations over the integers. 
However, this is not enough to prove that this model is as powerful as usual programming languages, 
such as Python. So, the purpose of this section is to prove the following result. 


Theorem 1.5.1. Jf a function is computable by a Turing machine, then there exists a Python program 
which computes the same function. 


1.5.2. To be fully formal, we would need a mathematical definition of Python. However, we saw in 
Section that it was already hard to define Turing machines, and Python is orders of magnitude 
more complicated than that! Fortunately we can still give convincing arguments which use our 
intuitive understanding of Python. 

By the way, the ideas of this section apply as well to almost any programming language. We 
chose Python because it is rather popular, but any language (C++, Common Lisp, Forth, Erlang, 
Smalltalk..) will do. Some programming languages even have mathematically precise definitions, 
most notably Scheme If and Haskell n so all the arguments of this section could be turned to a 
fully formal proof for Scheme or Haskell. Unfortunately, this would require a huge amount of work, 
and we have better things to do for today. 


The easiest way to prove Theorem is to write a Turing machine interpreter in Python. In 
other words, we want to write a function in Python that takes a Turing machine P and an integer n 
as an input and that returns P(n), if it exists. Let us call this function compute. How does it help to 
prove the theorem? Suppose that we have compute in Python. Then, for any computable function 
f, there exists a Turing machine P which computes f; so we can just write a Python program that 
calls compute on that Turing machine (we can hardcode the machine in our program). 


Exercise 1.5.3. Write a Turing machine interpreter (the function compute) in Python. 


The rest of this section gives the solution to that exercise. You may skip it if you are confident 
that Turing machine can indeed be run by Python programs. If not, please try to write a Python 
interpreter by yourself before reading on! 


1.5.4. Let us start to write some code. The next listing shows the definition of a class that handles 
Turing machines. It follows closely 91.2.3, 


class Node(): 
def init (self, label, next-None, i-None): 
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HA 
def 
def 
def 
def 
def 
def 


# Label can only be one of these: 
if not label in ["L", "R", "W", "T", "start", "end"]: raise(ValueError) 
# We need an integer i for Wi and T à 
if label in ["W", "T"]: 
if not (type(i) is int): raise(ValueError) 
# But we don't want i for any other label 
else: 
if not (i is None): raise(ValueError) 
# Finally, we can initialize our node 
self.label = label 
self.i = i 
self.next = next # This represents nect if true for T à 
self.next if false = None # Only used for T à 


# The next 4 lines allow to write v.nect if true for v.nect 
Cproperty 

def next_if true(self): return self.next 
Onext if true.setter 

def next if true(self,value): self.next - value 


few convenient functions to build nodes 
start(next-None): return Node("start", next) 
end(): return Node("end") 

L(next-None): return Node("L", next) 
R(next-None): return Node("R", next) 

W(i, next-None): return Node("W", next, i) 
T(i, next if true-None, next if false-None): 
v = Node("T", next if true, i) 
v.next if false - next if false 

return v 


1.5.5. The next listing shows the definition of a class that models configurations and that can 
compute successors, following 81.2.4. The only difference between the code and the formal definition 
is that the Configuration class doesn't keep a pointer to the Turing machine P, but only a pointer 
to the current node v. Besides that, the code is a straightforward translation of the math. 


class Configuration(): 


def init (self, node, memory-[0], head-0): 
if head >= len(memory): raise(ValueError) 
if not type(node) is Node: raise(TypeError) 


self.node = node 
self.memory - memory 
self.head = head 


def successor(self): 
if self.node.label -- "start": 
self.node = self.node.next 
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elif self.node.label == "L": 
self.head = max(self .head-1, 0) 
self.node = self .node.next 


elif self.node.label == "R": 
self.head = self.head+1 
# Since the memory is infinite, we extend it as needed 
if len(self.memory) == self.head: self .memory. append (0) 
self.node = self.node.next 


elif self.node.label == "W": 
self .memory[self.head] = self.node.i 
self .node = self .node.next 


elif self.node.label == "T": 
if self.memory[self.head] <= self .node.i: 
self.node = self.node.next_if true 
else: 
self.node = self.node.next_if false 


elif self.node.label == "end": pass # Nothing to do! 


return self 


1.5.6. Finally, the next listing shows three functions. The_first two are functions to convert an 
arbitrary integer to its memory representation (defined in gi 3.1) and back. The third function is 
compute: it takes a Turing machine P and an integer n as parameters, and computes P(n). Note 
that if P enters an infinite loop, then the compute function will also loop! 


def integer to representation(n, base-2**31): 
result - [] 
while True: 
q,r - n//base, n/base 
result .append(r) 
if q == 0: 
result.append(base) # The end marker is 2**31 
return result 
else: 


n-q 


def representation to integer(representation, base-2**31): 
result = 0 
n= 1 
for i in representation: 
if i >= base: break # Stop if we found the end marker 
result += i * n 
n *- base 


12 


return result 


def compute(program, n): 
# We suppose that ^program' points to the "start' node 
if not program.label -- "start": raise(ValueError) 


M 
c = Configuration(program, M, 0) 
while c.node.label != "end": 

c = c.successor() 


Il 


integer_to_representation(n) 


return representation_to_integer(c.memory) 


1.5.7. Now, it is not hard to translate any Turing Machine into a Python program. The next listing 
shows how to write the Turing machine from Figure in Python. The idea is to first create all 
the nodes using the functions start, L, R, W, T and end. Then, for each node v, set all the edges by 
changing v.next (or v.next if true and v.next if false if it’s a T node). Once the graph is 
created, simply call compute on it and on the desired integer n to compute P(N). 


def machine of figi(n): 
## Nodes 
tm = [ 
# Left branch 
start(), 
W(1), 
RO, 
TU, 
T(3), 
# Upper branch 
RO, 
TCS), 
# Lower branch 
W(2), 
RO, 
WO), 
# End 
endO , 


HH Vertices 

# Left branch 
tm[0] .next = tm[1] 
tm[1].next = tm[2] 
tm[2].next - tm[3] 
tm[3].next if true, tm[3].next if false - tm[4], tm[1] 
tm[4].next if true, tm[4].next if false = tm[5], tm[7] 
# Upper branch 

tm[5].next = tm[6] 

tm[6].next if true, tm[6].next if false 


M 


Il 


Il 


tm[2], tm[10] 
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# Lower branch 

tm[7].next = tm[8] 
tm[8].next = tm[9] 
tm[9].next = tm[10] 


## Finally, we can just call compute 
return compute(tm, n) 


The whole code given above makes an implementation of the Turing machine from Figure 
in Python. This code can trivially be adapted to any other Turing machine. By our remarks from 
91.5.2, this code is enough to prove Theorem 1.5.1. 


1.6 A Turing machine that interprets Python 


Still because we want to argue that Turing machines are a good model of programming languages, 
we would like to prove the converse of Theorem 1.5.1. 


Theorem 1.6.1. For each Python function f defined over a subset of N, there exists a Turing 
Machine P that computes f. 


The remarks in also apply here: almost any programming language could be used instead 
of Python, and ideally we would need a mathematical definition of Python (or whatever language we 
use). Ultimately, though, our main task is to translate an arbitrary Python program into a Turing 
machine. Since this is a very difficult task, we will split it into two steps, and we will skip the details 
of the first step. 


Claim 1.6.2. Let Simple Python denote the restriction of the Python programming language that 
only allows integers and arrays, arithmetic (+, —, x, +, = and <), if/else statements, and while 
loops. Any Python program can be translated in Simple Python. Moreover, the translator itself 
can be written in Simple Python. 


Indeed, all features of normal Python that are not in Simple Python (such as strings, dictionaries, 
objects, for loops, exceptions, etc.) can ultimately be translated into arrays of integers. The Python 
interpreter actually does this in order to execute Python code, but we will not explain the details. If 
you are interested, you can refer to a recent textbook about compilers. 

It might be hard to believe that the Python Simple Python translator can be written in Simple 
Python. However, it seems perfectly reasonnable that such a translator may be written in normal 
Python. For instance, Pypy ll is a Python interpreter which is written itself in Python; it does 
a translation to Simple Python as a part of its interpretation process. So, suppose that we have a 
translator written in normal Python. Then, we can run the translator on its own source code in 
order to get a translator written in Simple Python. 


Exercise 1.6.3 (Optional). Rewrite the Turing machine interpreter of Section in Simple Python. 


1.6.4. Our task is now to write a (mathematical) function T' that translates any Simple Python 
function to a Turing machine that computes the same function. We already have Turing machines 
for arithmetic expressions: this was 91.4.2] We still need to handle variables, arrays, if/else 
instructions and while loops. 


Exercise 1.6.5. (a) For each integer i in N, design a Turing machine that, on input: 
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VO jt Ul ++ soe Uk-1 F H 01010 


writes a copy of v; after the ##. Note that vo, v1,...,Ug—1 are blocks of several cells, not just single 
cells. 


Hint: you need to temporarily change the original v; in order to write a copy, but you can restore 
it later. 


(b) Conversely, for each integer i in N, design a Turing machine that, on input: 


Uo + vı Fl... Uk—1 # | # Uk # (9/010 


erases v; and writes a copy of vz instead. Note that v; might be shorter or longer than v;! Besides, 
we might have ¿ > k; in this case, cells containing just 0 should be inserted to expand the array up 
to the right size. 


(Instead of writing this Turing machine by hand, you might want to write a program that produces this 
Turing machine. You can use the Python code developped in the previous section as a starting point.) 


1.6.6. A 2-memory Turing machine a Turing machine with two memories. In other terms, in a 
2-memory configuration, each memory cell contains a couple of numbers instead of a single number. 
Call (i,j) the contents of the memory cell just under the head. Instructions start, end, L and R 
work the same as with usual Turing machines. There are two instructions for testing: T and T". 
The first one tests against 2, while the second one tests against j. Similarly, there are two writing 
instructions: W and W’; the first rewrites i while the second rewrites j. 


Exercise 1.6.7. Prove that a function f is computable by a Turing machine if and only if it is 
computable by a 2-memory Turing machine. 


1.6.8. Thanks to Exercise 1.6.7] we can use two memories instead of one. Therefore, we can use one 
memory to evaluate arithmetic expressions (as in the previous section) and one to store the values 
of variables, using the same schema as in Exercise Arrays are handled similarly, since they are 
just variables that occupy more than one cell of memory. 


Exercise 1.6.9. Suppose we have two Turing machines Ti, To, and 73 for Python programs P4, Pa, 
and P; respectively. Design Turing machines that do: 


. if(P,): P, else: Pi 
. while(P,): P, 


1.6.10. We have implemented all the features of Simple Python with Turing machines, therefore, 
Theorem is proved. 


1.7 The Universality theorem 
Here is a direct consequence of Theorems and 1.6.1] 


Proposition 1.7.1. Turing machines are equivalent to Python, in the sense that a function defined 
over a subset of N is computable by a Turing machine if and only if it is computable in Python. 


It helps to establish the following result. 
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Theorem 1.7.2 (The Universality Theorem). There is a sequence of Turing machines (Pa)nen and 
a Turing machine U such that: 


e all Turing machines appear in (P,,): for each machine Q, there is an integer i such that Q = P;; 
e if U is run on input mn 4 for two integers m and n, then it computes P.(n). 


The second point means that if Pm terminates on input 7 with output 7, then U terminates on input 
m#n# with output T. Conversely, if P,, does not terminate on input 7, then U does not terminate 
on input Mm # TN 32. 


Proof. By Theorem m any Turing machine can be translated to a Python program. So for the 
sequence (P,) just set P, to be the Python program represented by the integer n (viewed as a 
sequence of bits). Then, the machine U is just a Python interpreter. There exists a Turing machine 
that interprets Python, because there exists a Python program that interpret Python (Pypy Bh. and 
because any Python program can be translated into a Turing machine by Theorem 


1.7.3. Theorem [L.7.2 says that, if we have a program written somewhere in the memory, then it is 
possible to run that program as if it were part of the source code. Or, in other terms: the source code 
may be in memory instead of being fixed in advance. This allows to do metaprogramming: programs 
that write programs (and then, run them). 


1.8 The Halting theorem 


1.8.1. For this section, we start with a practical question called the Halting problem: 

Given a Turing machine T and an input x, does T terminates on input x? 
Thanks to Proposition 1.7.1} it is equivalent to ask this question about Turing machines and about 
Python programs. Thus the task is as follows: we are given (the source code of) a Python function 
P and a number z, and we need to compute whether P(x) is going to meet an infinite loop, or it will 
return something. Since infinite loop are usually not a desirable feature, it is interesting to detect 
them. 


Theorem 1.8.2. The halting problem is undecidable. 


By this we mean that no Turing machine H (or no Python program) exist that takes another 
Turing machine P (or another Python program) and a memory M as an argument, such that 
H(P, M) = 1 if P(M) terminates and H(P, M) — 0 if P(M) does not terminate. In other terms, it 
is not possible to algorithmically test whether a given program loops or not. 


Proof. Suppose a function halt() exists in Python somehow, and consider the following program: 


# Suppose that halt(P,x) returns 0 if P(x) loops and 1 otherwise 
def problem(s): 
if halt(s, s): 
while True: pass 
else: 
return 0 


# What does this print? 
print (problem(problem)) 
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If this program terminates, then by definition this means that ‘problem(problem)‘ has an infinite 
loop. But if ‘problem(problem)‘ has an infinite loop, then the last line of the code also has an infinite 
loop, and this program does not terminate: we have a contradiction. 

Conversely, if this program does not terminate, then this means that ‘problem(problem)‘ termi- 
nates. But if ‘problem(problem)‘ terminates, then the last line will terminate, so the whole program 
will terminate for sure. We have again a contradiction. 

5o, in any case, supposing that halt() exists in Python gives a contradiction. By Proposi- 
tion mall this is equivalent to say that the corresponding Turing machine cannot exist either (it 
would also yield a contradiction). 


An important consequence is that computers are not all-powerful: there is at least one problem 
(namely, the halting problem) that they cannot solve. Besides, the Halting theorem is also useful to 
prove that other problems are undecidable, thanks to a technique called reduction. 


Lemma 1.8.3 (Reduction). Let U and (P,) be as in Theorem [1.7.4 Call h(n, x) the function such 
that h(n, x) = 1 if P,(x) terminates, and h(n) = 0 otherwise. If f is a function and a and algorithm 
such that f(a(n,x)) = h(n,x) for each n and each x, then f is uncomputable. 


Proof. Indeed, if f were computable, then the function n, x +> f(a(n, x)) would be computable (the 
composition of two computable functions is computable). But then h would be computable, which 
is a contradiction with Theorem 1.8.2. 


This lemma just says that, if we want to prove that a function f is uncomputable, all we have to 
do is to find an algorithm a such that f(a(x,n)) = h(x,n) for all x and all n. We will see examples 
of application of this lemma in the exercises, and in next lectures. 
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Computations: from Turing Machines to Tilings Dubna, 22nd July 2018 


l.e Exercises for Lecture 1 


Here are all the exercises from the lecture notes, reorganized and renumbered to make an exercise 
sheet. There are also a few new questions, to keep things interesting. Solve only what you like. 


Exercise 1.e.1. In this exercise, we deal with numbers that might be greater than 2?! (but still 
nonnegative). Recall that integers are represented as sequences no, n1,..., Ny, 4, where # = 2?! and 
T = Nng... Nino in base 2?!. Differently said: the first digit to appear is the lowest-significant one. 


(a) Design a Turing machine that computes the function n — n 4 1. 


(b) Let z = 2?! — 1; draw the space-time diagram of the Turing machine you designed for (a) on the 
input z, 2, 2, 4r. 


(c) Design a Turing machine that computes the function m, n — m +n. 


(d) Design a Turing machine that computes the function m, n > 0 if m = n and 1 otherwise. 
520 Design a Turing machine that computes the function m, n — 0 if m < n and 1 otherwise. 
2n Design a Turing machine that compute multiplication of integers. 


Ye Design Turing machines that compute substraction and division of integers. Hint: reuse your 
addition/multiplication machines, and use a brute-force algorithm. 


Exercise 1.e.2. 


(a) Write a Turing machine interpreter in Python (or any language you like). 


(b) Improve your interpreter so that it shows the space-time diagram of the computation that it is 
running. 


Exercise 1.e.3. Design a Turing machine that takes a sequence of integers between 0 and 2?! — 1 
(included), terminated by #, and sorts that sequence in nondecreasing order. 


Exercise 1.e.4. 


(a) Prove that a function f is computable by a Turing machine if and only if it is computable by a 
2-memory Turing machine (cf. 8[1.6.6). 


(b) Write the definition of a k-memory Turing machine, and prove that k-memory and /-memory 
Turing machines are equivalent (in the sense of the previous question) for all k,g > 1. 


(c) Prove that the set of computable functions is the same for any finite set I with at least two 
elements. In other terms, whether I = {0,..., 2°? — 1} or I=(0,1,...,9) or I = {0,1} does not 
change which functions are computable. 


La Prove that a Turing machine where the memory is bi-infinite, i.e., a function Z — I instead of 
N > I, can compute the same functions as a normal Turing machine. 


18 


Exercise 1.e.5. Find, on the internet: 


(a) An undecidable problem other than the Halting problem. Try to understand its proof. 


(b) A compiler from any programming language to Turing machines. Try it on simple programs and 
look at the resulting machines. 


520 Try to read a bit of the source code of the compiler you found. 


Exercise 1.e.6 (S. Write a Python class that models an arithmetic expression, i.e., a binary tree 
where internal nodes are labeled by arithmetic operations (+, —, x, +, =, <) and leaves have labeled 
by integers. Then, write a translator from arithmetic expressions to Turing machines. 


Exercise 1.e.7 (S. Feel free to change Il in order to have more additional symbols: for instance, 
you can set I = (2% — 1}. 


(a) For each integer ¿ in N, design a Turing machine that, on input: 


Vo + Ul # € Uk-—1 H H 0/01] 0 


writes a copy of v; after the ##. Note that vo,v1,..., vy 1 are blocks of several cells, not just 
single cells. Hint: you need to temporarily change the original v; in order to write a copy, but 
you can restore it later. 


(b) Conversely, for each integer i in N, design a Turing machine that, on input: 


Vo Ü Ui LM ERE Uk-1 # | # Uk #|9|0|0 


erases v; and writes a copy of v; instead. Note that vj, might be shorter or longer than v;! Besides, 
we might have i > k; in this case, cells containing just 0 should be inserted to expand the array 
up to the right size. 


Exercise 1.e.8 (S. Suppose we have three Turing machines T1, T2, and 73 for Python programs 
P4, Pa, and P3 respectively. Design Turing machines that do: 


. if(P,): Pj else: Pj 


. while(P,): P, 


Exercise 1.e.9 (P22), Write a Simple Python — Turing machine compiler. (The definition of 
Simple Python is in Claim 1.6.2). Reuse the constructions you designed in the previous exercises! 


Contacts 


e Daria Pchelina (dpchelina@clipper.ens.fr) 
* Guilhem Gamard (guilhem.gamard@normale.fr) 
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Lecture 2 


Wang tiles and the Domino Problem 


2.1 Introduction 


In the previous lecture, we learned about the notion of computability. A problem is computable if 
and only if there exists an algorithm solving it; thus, it can possibly be done by a computer. On the 
other hand, we saw that some problems were not computable. As mathematicians, one thing that we 
would like to do is to make our computers prove theorems for us, automatically. Hence the natural 
question: is theorem-proving computable? Let us look at an example. 


2.1.1. Suppose that we have a logic formula like: 


Vx € N, y EN, Yz EN, A(z, y, z) 


where @ is a formula without quantifiers (i.e., without V and 3) and without free variables (i.e., all 
variables are either x, y or z). However, ¢ might contain function names, predicate names, and 
constant names. One important question is: does this formula have a model? In other terms, can we 
find an actual function for each function name appearing in ¢, an actual predicate (i.e. a function 
returning a boolean) for each predicate name appearing in ¢, and an actual integer for each constant 
name appearing in $, so that the whole formula VxdyVzó(z, y, z) is true? 


For instance, if we want to find a model for the formula: 


Vx EN, ly € N, Yz € N, G(y, x) A[G(y, 2) => (-D(z, y) V G(t,z))] (2.1) 


then we need to find two predicates G and D, as well as a constant t, which make this formula true. 
Here is an example of such a model: 


e G(m,n) is “m > n^; 
e D(m,n) is “mis a divisor of n^; 
e tis the constant 2. 


With these interpretations for G, D, and t, the formula means “there are infinitely many prime 
numbers”, which is obviously true. (The formula reads: for each integer x, there is an integer y > x 
such that, for each integer z « y, either z is not a divisor of y or z « 2.) 

Be careful, though: the formula (b.1) does not necessarily mean “there are infinitely many primes". 
It might have another, very different model where G, D, and t are completely unrelated to prime 
numbers. In other words, we just gave one possible interpretation (i.e., one model) of the formula 
that makes it true, and this interpretation means that there are infinitely many primes (which is 
true). Other interpretations making the formula true (models) are possible. Of course, there also 
are interpretations that make the formula false, but they are usually not very interesting to explore. 
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2.2 Tilings (at last!) 


2.2.1. In 1961, Hao Wang tried to write a program that automatically checks whether an arbitrary 
yENTENYEN formula has a model. He found a way to translate the problem of finding a model to a 
much simpler combinatorial problem, which he describes as follows: 


Assume we are given a finite set of square plates of the same size with edges colored, 
each in a different manner. Suppose further there are infinitely many copies of each plate 
(plate type). We are not permitted to rotate or reflect a plate. The question is to find an 
effective procedure by which we can decide, for each given finite set of plates, whether we 
can cover up the whole plane (or, equivalently, an infinite quadrant thereof) with copies 
of the plates subject to the restriction that adjoining edges must have the same color. 


— Wang, 1961 [9] 


In other words, a tile is a square of unit size with colored edges, like this: X A set of tiles is 
called a tileset; each tile in a tileset can be copied infinitely many times. Now the question is: given 
a tileset A, can we cover the whole plane with copies of tiles from A, such that two adjascent borders 
always have the same color? Such a covering is called a valid tiling (or just a tiling). Figure 
shows a tileset and a piece of a tiling of the plane. 


Figure 2.1: A tileset and a (part of) a valid tiling by this tileset. 


It changes nothing if we consider that the plane is equipped with a grid made of unit squares, 
and that our task is to fill each square with a copy of a tile (respecting the color rule). Thus, each 
cell of the grid is uniquely determined by the coordinates of its bottom left-hand corner. A valid 
tiling is, formally, a function from Z? to A, which satisfies the previously stated color constraints. 
Therefore, if 7 is a tiling, we will write 7(0,0) for the tile which lies in the cell of coordinates (0, 0). 
More generally, if D is a subset Z?, a valid tiling of D (by the tileset A) is a function from D to A 
which satisfies the color constraints. 


Notation. Ifa is a tile, then we denote by north(a), west(a), south(a) and east(a) respectively the 
color on top, on the left, on bottom or on the right of a. 

Capital letters A, B, C... usually denote tilesets, while small letters a, b, c usually denote tiles. 
Capital letters T, U, V... denote tilings. 


2.2.2. Some tilesets have valid tilings of the plane; for instance, the set x IX) trivially has 
tiling (see Figure b.3). Conversely, some tilesets do not have any valid tiling; for instance P | K 
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because there is red on the right of the first tile, but no tile has red on the left side; and there is 
green on the bottom of the second tile, but no tile has green on the top side. If a tileset has a valid 
tiling of the plane, then we say that this tileset is solvable. Conversely, a tileset without any tiling 
of the plane is called unsolvable. 


Figure 2.2: A simple tiling. 


Question 2.2.3 (The Domino Problem). Given a tileset A, is A solvable? 


As mentioned above, Wang could translate the problem of finding a model for any VENIENYEN 
formula into the Domino problem. More precisely, Wang wrote a program that takes a formula $ as 
input and returns a tileset A as output, such that o has a model if and only if A has a tiling of the 
plane. Since this algorithm is very complicated, we will not give the details here. 

In order to do automatic model-finding, the only remaining step is to find an algorithm for the 
domino problem. First, the next theorem is very useful when we try to tile the plane. 


Theorem 2.2.4 (Compactness). If a tileset A can tile (in a valid way) a square of any finite size, 


then it can tile the whole plane. 


Exercise. Prove the Compactness theorem. (Solution below.) 


Proof of Theorem (Compactness). Let T'(n) denote, for each integer n, a valid tiling of an nx n 
square by A. Consider (r,),ew a sequence of elements of Z? that covers Z?, for instance a spiral 
(see Figure 2.3). Since there are finitely many tiles in A and infinitely many T'(n)'s, the pigeonhole 
principle implies that there are infintely many T'(n)'s that agree on the tile at position zo. In other 
terms, there is a tile ag and an infinite subsequence of T(n), let us call it T(ao(n)), such that 


Yn EN, T(ag(n))(xo) = ao. 


Now there are infinitely many tilings in in T(ao(n)) and finitely many tiles in A. Thus there exists 
a tile a; and an infinite subsequence of T'(ag(n)), let us call it T'(o(n)), such that 


Yn EN, T(oi(n))(xo)-— ao and  T(oi(n))(zi) = ai. 


By recurrence, for each integer k, there exists an infinite subsequence of T(n), call it T(a,(n)), such 
that: 
Vi € {0,...,k}, Vn EN, TY(ox(n))(zi) = ai. 
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Now define U to as follows: for each n in N, let U (£n) = T (o,,(0)) (£E. Since each point in Z? is 
somewhere in the z,,'s, U is a function Z? > A; but it might not be a valid tiling, because there might 
be two touching square borders with different colors. Suppose this is the case, for instance that there 
exist positions (7,7) and (i + 1, j) such that east(U (i, 7)) 4 west(U (i + 1, j)). By construction of U, 
there are infinitely many tilings in T (n) such that T(n)(i, j) = U(i, j) and T(n)(i4-1, 5) = U(i-- 1, j). 
More precisely, if (i, j) = x, and (i + 1,j) = zy, then set m = max(k, k’) and any tiling in T(a,(n)) 
has this property (for any n). But all the tilings in T(n) are valid, so all the touching borders have 
the same color: a contradiction. As a conclusion, U is a valid tiling of the plane. 


X2 


Lo Tı 


Figure 2.3: A sequence of elements of Z? that covers Z?. 


2.3 Wang’s algorithm 


2.3.1. A tiling T is called periodic if there exists a nonzero vector Y in Z? such that, for any x in Z?, 
we have T(x) = T(x +0). In other terms, T is invariant under translation by v. In this case, we call 
U a period of the tiling. We insist that the zero vector (0,0) is never a period. 

Moreover, if Y and v are two periodicity vectors of T and k, k’ two nonzero integers, then ki'+k’v’ 
is also a periodicity vector of T. 


Exercise 2.3.2 (Optional). Let T denote a solvable tileset. Prove that if T have a valid tiling with 
one period ¥, then it has a tiling with two periods v’ and v" which are not colinear (proportional). 


2.3.3. A tiling of a finite square is called repeatable if it is valid and if four copies of this square 
arranged in a 2 x 2 fashion (like on Figure are also a valid tiling. More formally, consider a tileset 
A, a discrete square (0,..., n — 1)? (this is a cartesian product), and a tiling T : (0,...,n -12 > A 
of this square; then T is repeatable if and only if, for all i € (0,...,n — 1}, 


e the bottom side of T(0,1) has the same color as the top side of T(n — 1,7); and 


e the right side of T(i,n — 1) has the same color of the left side of T(i, 0). 


lFormally we need some form of axiom of choice to justify that U actually exists, for instance Koning’s lemma. 
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Figure 2.4: A repeatable square. 


Lemma 2.3.4. Let A denote a solvable tileset. There exists a periodic tiling by A if and only if there 
exists a repeatable square for A. 


Proof. If a repeatable square exists, we can immediately build a periodic tiling by just repeating the 
square over and over all the plane. 

Conversely, suppose that a periodic tiling T' exists. Then, by Exercise b.3.9) then there exists a 
tiling 7" with two periodicity vectors which are not colinear. Call them vo; = (21, y1) and U3 = (12, y»). 
Observe that x20] — 110 is a vertical vector, while yov; — y,02 is an horizontal vector. Since T” has 
both a horizontal periodicity vector (112,0) and a vertical periodicity vector (0, y1y2), it contains a 
repeatable square of size z123 X yiy». 


Theorem and Lemma together suggest the following algorithm to solve the domino prob- 


lem. 
Algorithm 2.3.5 (Wang, 1961 ipl). Input: a tileset A. Output: whether A is solvable or not. 
Ld] 
2. Try all possible tilings of a square of size n x n by A; 
3. If there is no valid tiling, return false; 
4. Else, if there is a repeatable square among valid tilings, return true; 
5. Else, do n + n + 1 and go back to Step bl 
The proof of correctness of this algorithm relies on the following idea. 


Conjecture 2.3.6 (Wang, 1961 io]. A tileset is solvable if and only if it has a periodic tiling. 
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Wang’s conjecture implies that the algorithm is correct. Indeed, if the given tileset A is solvable, 
then by Conjecture and Lemma , A has a repeatable square. Since the algorithm tries all 
possible squares in search for a repeatable square, it will necessarily find it and conclude that A is 
solvable. On the other hand, when A is unsolvable, the algorithm returns false. If it did not, then 
it would mean that any square (of arbitrary size) is by tilable by A, so Theorem 2.2.4 would imply 
that A is solvable: a contradiction. 


It only remains to prove Wang’s conjecture. Unfortunately, it turns out that 
Theorem 2.3.7 (Berger, 1964 2). The Domino problem is undecidable. 


which means that there is no algorithm that solves the Domino problem. This was shown in 1966 
by a student of Wang, Robert Berger [2]. The proofs works by encoding Turing machines in tilesets 
and translating the Domino problem to the Halting Problem (Section El which is undecidable. 


2.3.8. Since the Domino problem is undecidable, Algorithm must be incorrect. However, 
we are sure that if this algorithm returns true, then the given tileset was solvable (because the 
algorithm found a repeatable square); conversely, if the algorithm returns false, then the given 
tileset was unsolvable (because the algorithm found a size of square n which cannot be tiled). The 
only remaining possibility for this algorithm to be incorrect, is that it does not return anything at 
all. In other terms, there are some tilesets A on which the algorithm loops infinitely, without ever 
returning. 


2.3.9. Consider a tileset A that makes Algorithm loop. From the remarks above, A solvable, 
but it has no periodic tiling of the plane. A solvable tileset this property is called an aperiodic tileset. 
Conjecture stated that there is no aperiodic tileset, and therefore 


Conjecture is false. 


Berger provided such an aperiodic tileset in his work, as a part of his translation from the Domino 
problem to the Halting problem. Since then, many aperiodic tilesets were discovered, with fewer and 
fewer tiles. Understanding aperiodic tilesets is still an active area of research today WJ. 


2.4 Encoding Turing machines in tiles 


Before proving Theorem ER! which is quite difficult, we prove a simpler result. Namely, we consider 
the Domino Problem with fixed origin tile: we are given a pair (A, a) where A is a tileset and a an 
element of A, and we are asked whether there exists a tiling of the plane T by A such that T (0,0) = a. 
In other terms, we have the same problem as Domino, except that one specific tile have been placed 
in advance at the origin. We prove the following theorem. 


Theorem 2.4.1 (Wang et al, 1963 i). The Domino problem with fixed origin is undecidable. 


The plan of the proof is as follows: we start with a Turing machine P and an input x for P, and 
we design a tileset T such that each tiling (with a specific tile at the origin) by T must “draw” a 
space-time diagram of P(x). If P(x) does not terminate, then its space-time diagram is infinite, so 
T can tile infinitely many lines and thus it is solvable (with fixed origin). On the other hand, if P(x) 
terminates, then its space-time diagram is finite, so T' can tile only finitely many lines and thus it is 
not solvable (with fixed origin). 

Moreover, the translation of P and z into a tileset T will be an algorithm. Let us call 
this algorithm f, so we have f(P, x) = T. Now, suppose we have an algorithm d to solve the Domino 
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problem with fixed origin. Then by the remarks above, for any Turing machine P and any input 
x, we have d(f(P,x)) = 1 if P(x) terminates, and 0 otherwise. In other terms, d(f(P, x)) is an 
algorithm for the Halting problem; such an algorithm does not exist, so we have a contradiction. 
Consequently, there is no algorithm d that solves the Domino problem with fixed origin. 


2.4.2. For the remainder of this section, fix a Turing machine P and an input x for P. 


2.4.3. If Q is any Turing machine, write Q (34) for the computation of # on the input full of symbols 
#. Instead of making a tileset for the computation P(x), it will be more convenient to make a tileset 
for the computation P’(#), for some other machine P’. Let x = x(0)...z(n) and define P, to be 
the machine that first writes r on the memory, then moves the head back at the beginning, and runs 
P: 


(48949483 -E 


« P 


Lemma 2.4.4. The computation P(x) terminates if and only if P,(#) does. If these computa- 
tions terminate, they both have the same output. Moreover, the transformation of (P, x) into P, is 
computable. 


A slightly more general version of this lemma is known as the sẹ? theorem and was initially proved 
by Stephen Kleene. It is a fundamental result of computability theory. For this lecture, however, we 
will stick with Lemma 2 


2.4.5. Now it remains to build a tileset T such that, any tiling by T (with a specific tile at the 
origin) somehow “draws” the space-time diagram of P,(#). (We will make the meaning of “draw” 
precise later, in 82.4.10 and Lemma b.4.11)) To make this whole construction more readable, we will 
build several tilesets, (T1), (T2), (T3), etc., each slightly more complicated than the previous one. 
The last one in this sequence will be the desired T. We will not explicitly give Python code for that 
algorithm, but our explanations should make it easy to write it. 


2.4.6. For (ri), the set of colors is C = {H,L,L',R,R’} (their meanings will be explained soon) 
and the set of tiles is: 


{H, R, L} DIET {L,L'} (H,R,RY (R,RY 


R £ L£ CO O R R R R! (T1) 


H L L R R' 


The notation {H, R, £} for a color means that we make three copies of the tile: one with the color 
H, one with R, one with £. Thus this tileset has 13 tiles in total. The meaning of the colors is as 
follows: 
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e H means: the head is here; 

e L means: the head is immediately on my left; 

e // means: the head is on my left, but not just next to me; 
e R means: the head is immediately on my right; 

e R' means: the head is on my right, but not just next to me. 


Figure shows a line tiled by (ri). What (r1) does is to guarantee that there is no more than 
one head on each line. There are three kinds of tiles: the head, tiles on the left of the head, and tiles 
on the right of the head. Besides, the two tiles that touch the head are different from all the others; 
in a sense, they know that they touch the head. This extra information will become handy later; it 
is the reason why we have to distinguish between R and R’ (also between £ and £’). 


* * * * * * * * * * * 
RR R) R) RR LOL WD wu E (e 
mR RR CR RR FF LLE X^ Ll E 


Figure 2.5: A line tiled by (ri). The symbol * means “any color”. 


2.4.7. For (r3), the set of colors is C x I, a cartesian product, where C is the set of colors of (C1) 
and I is the set of values that can appear in a memory cell of P (the Turing machine we are trying 
to encode into tiles). Then, for each t in (r1), each x in I and each y in I, make the following tile: 


= y) (—, y) 
(—, x) (—, x) if south(t) = H; (—, x) a otherwise (T2) 


(H, x) (— y) 


where “—” means “the same color as in the tile t”. We now have 13 x | 


I? tiles, where [I| is the 


number of elements of I. 

This new tileset (r2) inherits the features of (T1): on each line, at most one tile is the “head”, 
and all other tiles “know” if they are to the left or to the right of the head. Moreover, the two 
tiles that touch the head “know” that they touch the head. But (112) does something more: each 
tile now has an arbitrary element of I on its bottom color, and this element is propagated as-is on 


its top color. The only exception is the head tile: it can have any element of I on its top side, no 
matter what its bottom element is. Finally, the tiles also propagate one element of I on their left 
and right sides; because of the color constraints we designed, this element is always the one found 
on the bottom side of the head. 


Figure b.d shows a line tiled by (r2). 


Lo Tı T2 T3 T4 * T6 XT Tg tg Tio 


T5 Z5 s| X5| Xs) H |5 |5 X5 X5 Ts 


Lo Tı T2 T3 T4 T5 L6 TT Tg Tg X10 


Figure 2.6: A line tiled by (r2). The x; and * are elements of I. The first components of the colors 
(H, R, L, etc.) are not shown. 


2.4.8. For (r3), the set of colors is C x I x P, where C is the set of colors of (T1), is the set of 
values that might appear in the memory cells of P, and finally P itself is the set of nodes of the 


2r 


Turing machine we're encoding into tiles. For each tile ¢ in (r2) and for each node v in P, add the 
following tile into (r3): 


es (03 v) = 03 v) (T3) 

E x4 v) 
where, once again, the “—” refer to the corresponding colors in the tile t from (12). If the label of 
v is anything except 7;, then there is only one edge going out from v, and we call v' the target of 


this edge. If the label of v is T;, however, there are two edges going out from v. Fortunately, each 
tile “knows” what is the contents of the memory cell under the head, because this information is 


propagated in the second component of the left and right sides of each tile of ED. Thus we have 
all the required information to uniquely determine v’. 

Tiling a line with (r3) is pretty similar to tiling a line with (72), except that the whole line 
“knows” one node in the graph of P, and guarantees that the next line has the next node in the 
execution of the program. 


2.4.9. Now comes the interesting part. We build (T4) from (r3) by removing some tiles. 


First step. Remove all tiles matching the following pattern: 


(+, y, v) do aan 
yt if v has label W,, 
where v € T, x € I and 
(R, x, v) (£, x, v) f Ax otherwise. 
(H, x, v) 


Indeed, remember that, in (r3) (and thus in (13), the head is free to rewrite its memory cell with 
whatever value (up to the tiler's choice). Removing those tiles guarantees that the head does not 
change the value of its cell, unless the label of the current node is W;; in this case, it guarantees that 
the newly written value is 7. 


Second step. For each v other than L and R, remove all the tiles matching the following pattern: 


(x, x, v) (x, x, v) whenever x Æ y. 


(x, x, v) 


This guarantees that, if the label of v is not R nor L, then the head doesn’t move. 
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Third step. For v = R, then remove all tiles that match one of the following patterns: 


(x, *, +) (x, *, x) 
(R, x, v) (L, x, v) fort ER; (p sv) (L', *, v) for « Æ H; 
(71, *, v) (L, x, v) 
(x, *, x) 
(R^, x, v) (R, *, v) for z ZW 
(R, *, v) 


These removals guarantee that: 
e the tile above H will have R on the next line (so the head becomes “the head is to my right”); 
e the tile above £ will have H on the next line (so it becomes the new head); 
e the tile above R will have R’ on the next line (so it is further from the head) on the next line. 


The construction of (13) then guarantees that the rest of the next line is going to be consistent. 
Figure shows two lines tiled by (T4), supposing v = R; we only show the parts of the colors 
related to the position of the head. 


R R RR RIR|IHI|ILCIIPZIZC LE 
R Ri RI R RR LL E E E 
R! R! R! R! R H L E £^ £^ £^ 


Figure 2.7: Two consecutive lines tiled by (74) for v = R. Only the H/R/L£ part of the colors is 
shown. 


Fourth step. Finally for v — 1B, remove the tiles matching the following pattern: 


(£, x, *) oo, 
(R, *, v) (C, x, d) for x sb (R', *, v) Ru) fora ZH: 
(H, x, v) (R, x, v) 
(£, x, *) 
(L, *, v) FEN x, v) for x # L’ 
(£, *, v) 


These removals guarantee that: 


e the tile above H will have £ on the next line (so the head becomes “the head is to my left”); 


?Experts will note that this has nothing to do with the V = L axiom of set theory 
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e the tile above R will have H on the next line (so it becomes the new head); 


e the tile above £ will have £’ on the next line (so it is further from the head) on the next line. 


The construction of (r3) then guarantees that the rest of the next line is going to be consistent. 
Figure 2.8 shows two lines tiled by (T4), supposing v = L; we only show the parts of the colors 
related to the position of the head. 


RIR LEGE TL LJ COJ COJ COUI ESE 
R) R) RI R| RR LOL E E LL if 
R! R! R R R H L £! £t £^ £t 


Figure 2.8: Two consecutive lines tiled by (T4) for v = L. Only the H/R/L£ part of the colors is 
shown. 


2.4.10. Let c = (P,v, M, h) denote a configuration and £ a line tiled by (74). We write £, for the 
ntb tile in £ and south(£,) = (Pn, Mn, Un). Then we say that £ encodes c if and only if: 


e v = t, for each integer n; 

e M(n) = m, for each integer m; 
Lemma 2.4.11. If a line £ tiled by (T4) has at least one cell with H on its south color, then £ 
encodes a configuration c. 

Moreover, if l,l are two lines tiled by (T4), both representing configurations, such that l can be 


put on top of £ to form a valid tiling, then the configuration encoded by l is the successor of the 
configuration encoded by £. 


Proof. The first statement is immediate: the position of the head is the position of the (unique) cell 
having H on its south color; the current node v is the node appearing on all south colors; and the 
contents of the memory are also read on the south colors. 

The second statement is true by construction of (T4), so it results from the remarks made in 


Gad dad ada bad 


Exercise 2.4.12. Why is the condition “has at least one cell with H on its south color” needed in 
this lemma? 


Exercise 2.4.13. Let us modify the definition of a Turing machine and of a configuration (SIL.2.4) 
so that the memory is biinfinite, i.e., a function Z — I instead of N — I. Prove that a function 
is computable with an N-memory-Turing machine if and only if it is computable with a Z-memory- 
Turing machine. Moreover, show that the transformation of an N-memory- Turing machine to the 
"equivalent" Z-memory- Turing machine is computable. 


2.4.14. Now we make (T5) by removing tiles from (74). Let s denote the vertex labeled start in 
P. Then, make T' by removing each tile in (74) that matches the following pattern: 


HE zm for x se # 


(*,x,s) 
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Theorem 2.4.15. The computation P,(#) (and therefore P(x)) does not terminate if and only if 
there exists a tiling T of the upper half of the plane (Z x N) such that: 


(H, Hs") 
T(0,0) = (R,4,s) (L, 4,5) 
(1, s.s) 


where s is the node of P labeled with start, and s' is its successor (the next node to execute). 


Proof. The definition of (T5) is the subset of (T4) such that the input given to the machine consists 
only of # symbols. By Lemma 2.4.11| the placement of the tile in (0,0) guarantees that the line 0 of 
any tiling by T' encodes a configuration. By Lemma again, the line number n 4- 1 encodes the 
successor of the configuration represented by line n. By definition of a computation (81.2.4), there 
are infinitely many lines if and only if the machine does not terminate. 


2.4.16. We still have one problem to overcome before claiming a proof of Theorem "Zu Namely, 
Theorem only deals with tilings of half of the plane, but we need tilings of the full plane. The 
Compactness theorem ( does not help here, because of the fixed tile at the origin. 

Fortunately, there is an easy solution: make a tileset (7'5’), where each tile is just the vertical 
symmetric of a tile of (75). Moreover, in each tile of (T5'), replace each color (t, v, s) with the color 
(t, v, s), where v is a copy of v (but different from v). The only exception is that 5 = s, where s is 
the node of P labeled with start. See Figure P.9| for an illustration of this transformation. 


(t4, x", v") (t1, 2,0) 
{ (ta, y, v) (t3, y, v) M — (to, y, v) (ta, y, v) 


(ti, £, v) (ta, 0") 


Figure 2.9: Transformation of a tile of (T5) into a tile of (T5'). We have v = v if v is labeled with 
start, and v Z v otherwise. 


By the design of (15^), it is impossible to put one tile of (T5) next to a tile of (T5') (since the third 
component of tiles are disjoint between (T5) and (T5')) except when the two tiles have the start 
node. Moreover, (T5) makes the computation moving up, while (T5) moves down. Consequently, 
any tiling by (76) = (T5) U (T5') must follow the schema of Figure bu 


Lemma 2.4.17. The tileset (T5) can tile N x Z (with the origin constraint of Theorem if 
and only if (T6) can tile Z x Z (with the same origin constraint). 


Consequently, 


Conclusion. The tileset (T6) is solvable (with the origin constraint from Theorem b.4.15) if and 
only if P(x) does not terminate. Moreover, the whole procedure transforming (P, x) into (T6) is 
computable. Since the negation of a boolean is obviously computable, we have a program that 
translates the Halting problem into the Domino problem with fixed origin. If the latter were decidable, 
then by our translation the former would be, and we would have a contradiction. Therefore, the 
Domino problem with fixed origin is undecidable. 
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tt 
Computation by (15), going upwards 


(0, 0) 


Computation by (7'5’), going downwards 
4 


Figure 2.10: This symmetric tiling does the computation of P(#) twice: once upwards and once 
downwards. 
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2.e Exercises for Lecture 2 


Here are all the exercises from the lecture notes, reorganized and renumbered to make an exercise 
sheet. There are also a few new questions, to keep things interesting. Solve only what you like. 


Exercise 2.e.1. (Lineland) We consider the one-dimensional version of the Domino problem: a 
Wang bar is a unit segment with colored vertices, and a finite set of Wang bars is a barset. Given a 
barset B, a one-dimentional tiling of a set of points S C Z by B is a function S — B. A tiling of 
the whole Z is called a tiling of the line. Of course Wang bars also have color constraints: a tiling is 
valid if for each pair of neighbouring bars, the adjacent vertices have the same color. For example, 
these two bars match: mg— mam——1m. 

Here is an example of a valid tiling of the line: 


A barset is solvable if it has at least one valid tiling of the line. 
A tiling of the line T' is periodic if there exists a nonzero integer p, called the period, such that 
T(x +p) = T(x) for all x in Z. 


(a) Prove the following statement (called the one-dimensional Compactness theorem). Let A denote 
a barset; if there is a valid tiling of the set (—n,...,--n) by A for every integer n, then there is 
a valid tiling of Z by A. 


(b) Show that Wang's conjecture holds for one-dimensional tilings: if a barset is solvable, then it 
has at least one periodic tiling of the line. 


(c) Give an algorithm solving the one-dimensional Domino Problem. 


Back to Flatland... Only Exercise talks about Wang bars. All other exercises talk about 


two-dimensional tiles. 


Exercise 2.e.2 (S. Let A denote a solvable tileset. Prove that if A has a valid tiling T' with one 
period Y, then it has a tiling (maybe different from T) with two periods ?; and va which are not 
colinear (proportional). 


Exercise 2.e.3. Design a set of 3 Wang tiles A such that all tilings of the plane by A, up to 
translation, look like this: 


where squares of different colors denote Wang tiles of different types. 
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Exercise 2.e.4. 


(a) Design a set of 9 Wang tiles A such that one of its tilings of the plane looks like this: 


where squares of different colors denote Wang tiles of different types. (One color can correspond 
to several tiles, but one tile corresponds to only one color on the picture). 
(If necessary, start by designing a tileset with any number of tiles inside, and then try to reduce it.) 


(b) Explain why the tileset from your solution of (a) also can assemble in tilings where all the tiles 
are “orange” (respectively “blue”, “violet”, “yellow”). 


Exercise 2.e.5. Find an algorithm that, given a directed graph G, generates a set of Wang tiles 
A such that G has a cycle if and only if A is solvable. The algorithm should work with a constant 
memory. The algorithm itself should not answer whether the graph has a cycle or not, it 
only should translate to a tileset. 


Exercise 2.e.6. Why is the condition “has at least one cell with H on its south color” needed in 
Lemma of the lecture notes? 


Exercise 2.e.7 (S. Let us modify the definition of a Turing machine and of a configuration (81.2.4) 
so that the memory is biinfinite, i.e., a function Z — I instead of N > I. Prove that a function is 
computable with an N- Turing machine if and only if it is computable with a Z- Turing machine. 


Exercise 2.e.8 ($3. Write a Python program that converts a Turing machine into a set of tiles, as 
described in Section 2.4. 


Contacts 


e Daria Pchelina (dpchelina@clipper.ens.fr) 
e Guilhem Gamard (guilhem.gamard@normale.fr) 
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Lecture 3 


The Robinson tileset 


3.1 Introduction 


3.1.1. In the last lecture, we proved that the Domino Problem with fixed origin is undecidable. This 
is a good step, but we still have to get rid of the constraint on the origin to prove that the original 
Domino Problem is undecidable. 

To do so, let's discuss the Robinson tileset, that we will denote by R. It was initially published 
by Raphael Robinson in 1971 t9]. The tileset R is aperiodic, meaning that it is solvable (it can tile 
the plane), but no valid tiling of the plane by R is periodic. Moreover, it contains relatively few tiles, 
which makes its description easy. 

Once R is well-understood, we show how to exploit it to fully prove that the Domino Problem is 
undecidable. (That proof can be found in the paper by Robinson, il.) 


3.1.2. A representation of R is displayed on Figure bil but it is not exactly a set of Wang tiles. 


e First, by contrast with the tilesets we have seen so far, Robinson tiles can be rotated, so we 
have to make 4 copies of each Robinson tile (one copy per possible orientation). 


e Second, Robinson tiles do not have colors on their borders, but rather colored arrows coming 
in and out. The rule is that two tiles can be neighbours if and only if each arrow going out of 
the first tile is prolonged by an arrow coming into the second tile. The prolonging arrow must 
have the same color as the initial one, and the endpoints should exactly match. Figure B 
shows an illustration of valid pairs of tiles. 


e Finally, observe that one tile (four if we count rotations) has small yellow dots in its corners. 
The rule is that whenever four tiles share one common corner, exactly one of them should have 
those yellow dots. Figure illustrates that rule. 


Formally, that last rule is not a “color constraint" in the sense of Wang tiles. However, it is not hard 
to design a set of Wang tiles that enforces that rule (see Exercise bo. A piece of a Robinson tiling 
satisfying those constraints is shown in Figure 9.4. 


Figure 3.1: Robinson tiles, up to rotation. 
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Figure 3.2: Examples of adjacent tiles. Figure 3.3: Four tiles sharing a corner. 
aes ae ch ce aa aa A O B SE a a a NE ID Il] ae on so ee fart 
s E BÉ; Ya T TE HE + Hr E Fe a nF: [T HE EE eter 
F3 F4 4" zy AA a a T mt ATA y + iz’ Ipp i 
AE a [Her ir a ec p pel o deem 
| 1 R (A 890100 0 0 004 ER Ls AS Sener SP 1 
Cx FEMME AT imi [ES E. MET T 
ptt bt H et hte tep a NA D E iA A O O AR te te 
H I-RO—H A E rg tt A H 
arc qc cc mn n | rar IE LET Pepe eee ee ap ps H 
| | | HH 
EE a AEEA EEEE IE tia e 
A a a A JP A o A LU Um URBEM em nom m n 
FR a E $ E E E S F9 r8 PA SES T AA + 
rek MS e o T MÍ [T HH [| Dee P Te TH nam 
ta Be Se eo = ee G Sa a ODER 
ny EI by Herat bh td hy g Va bY a B Ya ht | hg ht | bh ta hy h Va 
ri te —4^- nM q >| p" T “O, 4 ig l| >} Ii" 
wt + [A on een A SAA DA SD A 
Maa as a as rs a > 


Figure 3.4: A piece of a Robinson tiling. 


3.2 Tiling in the large: macrotiles 


3.2.1. Let A and B denote two tilesets. A projection of A on B is simply a function f : A > B 
that preserves color constraints. In other terms, if a; and a» are allowed to be next to each other, 
then f(a¡) and f(az) also are. If T is a tiling by A, then f(T) denotes the tiling by B obtained by 
applying f to each tile of T. The definition of a projection implies that if T is valid, then f(T) is 
also valid. 

A projection f : A > B is: 


e injective if two different tiles are always given two different images: Va1,a9 € A, a4, Has => 


f(a) 4 f (az); 


e surjective if any tile of B has at least one preimage by f: Vb € B, Ja € A, f(a) = b; 


e bijective if it is both injective and surjective. 


If f : A > B is an injective (respectively surjective, bijective) projection, then its extension to 
tilings f : A” >) B® is also injective (respectively surjective, bijective). Therefore, for all our work, 
it does not hurt to treat a projection between tilesets and its extension to tilings as the same object. 


Exercise 3.2.2. Design a set of Wang tiles A along with a surjective projection from A to R. 


3.2.3. Let A denote an arbitrary tileset. A macrotile over A is just a valid tiling of a square (say, of 
size n x n) by A. If M = (m,,..., my) is a set of macrotiles over A, all of the same size, then we 
can consider M as a tileset on its own. Indeed, we can check as usual if two macrotiles are allowed 
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Figure 3.5: Checking if two macrotiles may be neighbours. 


to be neighbours: all the colors along the edges in contact should match (see Figure B.5). In a sense, 
the colors on the borders of macrotiles are cartesian products of colors on the base tiles. 


3.2.4. Suppose we have a tileset A and a set M of macrotiles over A, all of the same size. We say 
that M is a macrotileset over A. Let f denote a bijective projection of A on M. This projection 
maps each tile a in A to a macrotile m in M, while preserving the validity of the color constraints. 
But since m is just a square tiled by A, we can again apply f to it: f(m) is a macromacrotile over 
A. Consider the set of all macromacrotiles: 


f(M) = (f(m)|m € Mj. 


Each macromacrotile is still a (very large!) square tiled by A, and the colors constraints are again 
preserved. Thus nothing stops us from defining f?(M), and f*(M), and so on. The satisfaction of 
color constraints is preserved at every step. Figure 3.6 gives an illustration of this process. 


Lemma 3.2.5. Let A be a tileset and M be a macrotileset over A. If there exists a bijective projection 
of A to M, then A is solvable. 


Proof. Call f the bijective projection and k the integer such that the elements of M have size k x k. 
Take a any tile in A, and observe that the macrotile f”(a) is well defined for every n. In particular, 
f" (a) is a valid tiling of a k” x k” square. Then the Compactness Theorem (8.2.4 implies that A 
is solvable. 


3.3 Robinson’s hierarchy of macrotiles 


3.3.1. Now we prove that R is solvable. We will define four sequences of macrotiles, called (an)nen, 
(dn)nen, (Cn)nen, and (dn)nen. First, ao, do, co and dy are just the four tiles of R with yellow dots in 
the corners. Recall that those four tiles are identical up to rotation; Figure shows how we assign 
the names to them. By the way, we define ap, bo, G and dy to be the versions of ao, bo, Co and do 
without the yellow dots in the corners (those tiles also belong to R, as you can check on Figure B.1)); 
we will use them a little later. 

In what follows, bn, c, and dn will always be rotations of an, with the same orientations as in 
Figure B.7, Thus it remains to define a, for n > 2. 


3.3.2. Let n denote some integer and suppose that an, bn, Cn and d, are defined. Then we define 
an (that’s a Greek a!) as follows: arrange one copy of an, one copy of bn, one copy of c, and one 
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color match color match color match 
f PET: 0 eee f PEREA A O IETS EEE E AET 
t —- f(t’) — Pe) 


Figure 3.6: Iterating a projection of tiles on macrotiles 


j Eg 
fi] a 


Figure 3.7: Robinson’s macrotiles ag, bo, Co and do. 
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copy of d,, in a square, leaving a gap of one cell between them, so that the gap has the shape of a 
cross. Then, add one copy of ay in the center of that cross. (See Figure 8.8.) The pattern a, is not 
quite a macrotile, because a macrotile has to have the shape of a square. Fortunately, there exists 
an unique way to complete a,, into a valid tiled square — we will prove this shortly. Then, a,, is 
just that unique completion o;, into a valid square. The macrotiles b,,, Cn and d, are defined to be 
rotations of an; since all tiles in R exist in all their possible rotations, they are also valid tilings of 
squares. 


An b, 


Cn dn 


Figure 3.8: The pattern o: a partial macrotile for a,. Its completion into a valid square is unique. 


Lemma 3.3.3. For each n > 1: 


(i) there is an unique completion of ay, into a valid square, which we call a +1; 
(ii) an is of size (2"*! — 1) x (27+1 — 1); 
(iii) an has one red arrow going out on the middle of its right side and on the middle of its bottom 
side; 
(iv) everywhere else on the borders of an, green arrows are going out; 
(v) the four tiles in the corners of an have yellow dots. 


Moreover, bn, Cn, and d, also have properties (ii)-(v), except that the directions mentioned in (iii) 
should be adapted to the rotation that defined bn, Cn and dy. 


Proof. Initialization. The base case, ao, happens to be a single tile, so all the statements of the 
lemma except the first one are tautological. 


(i) Figure shows o; a direct observation (try all possibilities) shows that there is only one 
possible completion. Figure shows the completed macrotile a4 (on the right), as well as an 
alternative intuition about how this macrotile was constructed. Let b1, c4 and dı be the rotated 
versions of a4. 


The tile ao is of size 1 x 1 = (2! — 1) x (2! — 1). 
The tile ag has red arrows pointing out of its right side and from its bottom side. 
The tile ag has green arrows pointing out from its left size and its top sides. 


The tile ag has yellow dots. 
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It is immediate to check that the tiles by, cy and dy have the same properties as ay, up to changing 
the directions of the red arrows according to the rotation. 


Induction. Now, suppose that all the properties of the lemma hold for some integer n. Figure B.11] 
shows a, again, with some information about a, that we know by recurrence hypothesis. We call 
gaps the four “corridors” that are bordered with green and red arrows pointing inwards. 


(i) The red and green arrows pointing inside the gaps (that come from the recurrence hypothesis 
on An, bn, Cn and dn), as well as the copy of ag in the middle, leave only one possible completion. 
The right gap and the bottom gap have outgoing red arrows, that are “crossed” by the joining 
green and red arrows from bn, dn and cn. The top gap and the left gap have outgoing green 
arrows, similarly “crossed” by joining green and red arrows from bn, an and Cp. 


(ii) By induction hypothesis, a, has border size 2"*! — 1, so an +1 has, by construction, border size 
2 x (25 —1) 12 2222 — 1. 


(iii) The unique completion of œn, which is a, ,1, has a red arrow pointing out at the right and at 
the bottom of its “gaps”. 


(iv) All the borders of an, bn, c, and d, that face the outside of anı have green arrows pointing 
out by recurrence hypothesis. Moreover, the only possible completion of a, has green arrows 
pointing out of the top and left “gaps”. 


(v) The tile in top left-hand corner of a, has yellow dots by recurrence hypothesis, and it is also 
the tile in top left-hand corner of a,,,,. Similarly, the bottom left-hand corner of b, has yellow 
dots (by induction hypothesis) and it is also the bottom left-hand corner of a,+1. A similar 
reasoning on cn and d, shows that the other corners of a, +1 also have yellow dots. 


We defined 5,44, Cn+1 and d,,,1 to be the rotations of an+ı. Thus all the properties on a, 1 imply 
the same properties on 54,1, c, 41 and d, +1, except that the orientation of the red and green arrows 
are adapted according to the rotations. 
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Figure 3.9: The pattern ao: a partial macrotile for a;. 


3.3.4. For any integer n, the function f : {an, bn, Cn, dn} > R mapping a, to aj, bn to bi, Cn to c4 
and d,, to dı is an injective projection. However, it is not surjective: the tile ag, for instance, has no 
preimage. This means that we can use this projection to “collapse” Robinson squares and somehow 
“move down” in the hierarchy, but we cannot use it to expand single tiles into Robinson squares and 
thus to “move up” in the hierarchy. 


40 


Q2 


Figure 3.10: Detailed transformation of a macro-tile of level 0 in a macrotile of level 1. 
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Figure 3.11: The pattern an, with additional information from Lemma b.3.3} 
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3.4 Robinson’s tileset is solvable 


Now we prove that Robinson’s tileset is solvable. 
Theorem 3.4.1. Robinson’s tileset is solvable. 


Proof. Lemma states that, for each n, the macrotile a, is a valid tiling by R of a square of size 
(2” — 1) x (2” — 1). The result follows by the Compactness Theorem "Y 


Theorem 3.4.2. For any tiling T of the plane by R and for any integer n, one of the macrotiles an, 
bn, Cn or dn occurs somewhere in T. 


Proof. First, one of a1, b1, c1 or dı should occurr in any tiling because of the rule about yellow dots. 
By recurrence, we must show that any occurrence of an, bn, Cn or dn is part of an occurrence of an+1, 
bn41; Cn41, Or dn41. (Note that there is no correspondance here: an occurrence of a, could be a part 
of an occurrence of c, +1, for instance.) This proof is similar for an, bn, Cn and dn, so it can be done 
for a,, only. 

The rest of the argument is just a long case analysis, without any surprise. It is thus left as an 
exercise in this version of the notes. The slides accompanying Lecture 3 (which could unfortunately 
not be displayed properly on the D-Day!) show where to look. 


Exercise 3.4.3. Prove that Robinson's tileset is aperiodic: no tiling of the plane by R has a period. 


Solution. Suppose there exists an R-tiling T' with a period p. Then, the translation of T' by p 
must send every occurrence of a, (respectively, bn, Cn, dn) on an occurrence of a, (respectively, bn, 
Cn, dn). We will prove in a moment that in all R-tilings, each tile belongs at most one occurrence of 
an Or bn or Cn or dn. This means, for instance, that no tile can belong to an occurrence of a, and an 
occurrence of b, at the same time, nor to two different occurrences of a, at the same time. In other 
terms, two occurrences of an/bn/Cn/dn never intersect. 

If we accept that this fact is true, then the translation of T' by the periodicity vector p should 
send each occurrence of a, (respectively bn, Cn, dn) to a disjoint occurrence of an (respectively bn, 
Cn, dn), so p should have length at least 2"*! — 1 (the size of a, and his friends). But since we have 
occurrences of a, for arbitrarily large n, the vector p should have a length greater than 2"*! — 1 for 
arbitrarily large n, which is not possible: we have a contradiction. 

Thus it remains to prove that: 


Lemma 3.4.4. In any R-tiling T, two macrotiles of the same level never intersect. In other terms, 
no tile of T belong at the same time to an occurrence of a, and to an occurrence of bn, Cn, dn, nor 
to another occurrence of an. 


Proof. The proof is by recurrence. For n = 1, the objects a4, b1, c; and dı are single tiles, so 
they cannot intersect. Suppose that for some integer n, the macrotiles an, bn, Cn and d, do not 
intersect. Consider an+ı, which is the completion of the pattern displayed on Figure bd A simple 
geometrical argument shows that if an occurrence of bn+1, Cn+1 or dn+1 intersected with a, +1, then we 
would necessarily have two occurrences of objects (a4, bn, Cn, dn} intersecting as well. By recurrence 
hypothesis, this doesn't happen. So occurrences of objects (a441, 0441, C441, dn+1} do not intersect 
either. 


End of solution. 
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3.5 The Domino Problem is undecidable (at last!) 


3.5.1. We have almost all the ingredients for the proof that the Domino problem is undecidable. 
We will proceed as in Section ba start from (R), and build gradually more complicated tilesets 
(R1), (R2), etc. At some point, we will take an arbitrary Turing machine and embed it in one of 
those tilesets, in the same way as we did in Section 2.4, but without requiring the constraint at the 
origin. But first, we need to modify (R) a little bit independently of any Turing machine, just to 
make things easier later. 


3.5.2. Let us start to describe (R1). Instead of having just red arrows, we want to have even red 
arrows (represented by dotted lines) and odd red arrows (represented by solid lines). They are defined 
as follows. Recall that, in a Robinson tiling, each red arrow is part of a red square, which is the 
border of a macrotile of size (2"*! — 1) x (2"*! — 1). If n is even, then the considered red arrow is 
an even red arrow, and if n is odd, then the considered red arrow is odd. See Figure 8.13 for an 
illustration what a tiling by (R1) looks like. 


The tileset ( R1), displayed on Figure 5.13 enforces the desired shape in its tilings. This set (R1) 
have been built from (R) with the following algorithm. For each tile r in (R): 
e ifr has one red arrow, then copy it twice: one with an even arrow and one with an odd arrow; 


e if r has two crossing red arrows, then copy r twice: once with the horizontal arrow even and 
the vertical arrow odd; and once with the horizontal arrow odd and the vertical arrow even; 


e ifr has no red arrows at all, just keep it as-is in (R1). 
The idea is that whenever two red arrows are crossing, one of them must belong to a red square of 
size 2" — 1 while the other one must belong to a red square of size 2"*! — 1. Therefore, they cannot 
have the same parity. Thus the tiling (R1) is built so that the parities of successive squares are 


alternating. Figure B.13 shows four squares of size 3 (in 1-macrotiles, thus, odd), one square of size 
5 (in a 2-macrotile, thus, even) and a piece of a square of size 7 (in a 3-macrotile, thus, odd). 
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Figure 3.12: Extended Robinson tileset, ( R1), with even and odd red arrows. Rotated tiles are not 
shown. 


3.5.3. From now on, even though they do exist, we will not draw yellow dots, green arrows, nor dotted 
red arrows anymore on our figures. We focus solely on red squares made of solid lines, understanding 
that those square must be organized as on Figure because of the local constraints imposed by 
not-drawn arrows and dots. 


3.5.4. We build (R2) by slightly transforming each tile of (R1): we move the solid red lines very 
close to their opposite border of the tile, in such a way that the red squares in tilings become very 
close to the border of the internal square of tiles they delimit. The resulting tileset, (R2), is displayed 
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Figure 3.13: The macrotile a4 in (R1). Different Figure 3.14: A tiling by (R1): red squares with 
shades of red are used to show the macrotiles of even (dotted) and odd (solid) red arrows. Green 
level 2, 3 and 4. arrows and yellow dots are not displayed. 


on Figure 5.15. Moreover, Figure 8.17) shows a tileset by (R2) (without the green arrows, dotted 
red lines nor yellow dots) and Figure B.18 shows how a single red square is tiled by (2). 

Note that there is a bijective projection between (R1) and (R2); in other terms, the difference 
between (R1) and (R2) is purely graphical, it does not change the local rules at all. However, (R2) 
will help to make our figures easier to read in the next paragraphs. 


PD) ed EH ES tH dh ml 


Figure 3.15: The tileset (R2) is a modified version of (R1), where the red arrows are closer to the 
border. 


3.5.5. In (R2), we call: 
e top tile any tile having a red line near its top border; 
e bottom tile any tile having a red line near its bottom border; 
e left tile any tile having a red line near its left border; 
e right tile any tile having a red line in its right border. 


Be careful: by construction of (R2), the bottom border of a square is actually made of top tiles, and 
the top border of a square is made of bottom tiles! Similarly, the right border of each square is made 
of left tiles, and the left border of each square is made of right tiles. In other words, the red squares 
tend to be “inside” the tiles that border them. Please see Figure to make this picture clearer. 
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Figure 3.16: Red squares of odd level tiled by — Figure 3.17: Red squares of odd level tiled by 
(R1). (R2). 


+ bottom tiles (red line in the bottom) 


+ top tiles (red line in the top) 


Figure 3.18: The top border of a square is made of bottom tiles, and vice-versa. 
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3.5.6. Consider again Figure B.17, We say that each tile belongs to a red square, namely the 
smallest square that surrounds that tile. For instance, in Figure big only the central tile belongs 
to the visible red square. Or, in Figure (don’t look at all the details), all the dotted tiles belong 
to squares of size 1 x 1, while yellow tiles belong to the square of size 7 x 7. (The tiles with arrows 
also belong to this square, we will come to their meaning in a moment.) 

Let C denote any red square; some tiles inside C belong to C', and some tiles do not: they belong 
to smaller squares that are contained in C. We say that a tile at position (x,y) is pure in C if it 
belongs to C, and if all tiles on the same line and on the same column as (x,y) in C also belong to 
C. For instance if C is a 7 x 7-square of Figure B.19) then the yellow tiles are pure in C, and the 
tiles marked with double arrows (<> and f) belong to C but are not pure. 

In a square C, the tiles that belong to C but that are not pure are called unpure. There are 
two kinds of unpure tiles: the vertical unpure tiles have a non-belonging tile on the same column, 
while the horiontal unpure tiles have a non-belonging tile on the same row. Each unpure tile is either 
vertical unpure, horizontal unpure, or both at the same time. On Figure 5.19. vertical unpure tiles 
are marked with horizontal arrows (<>) while horizontal unpure tiles are marked with vertical arrows 
($). The reason why the arrows seem reversed will become clear in a few paragraphs. Note that 
each tile belong to a square and is either pure, vertical unpure or horizontal unpure (or both kinds 
of unpure at the same time) in that square. 


Figure 3.19: Yellow cells denote pure cells, arrows denote vertical unpure and horizontal unpure cells, 
dotted cells are inside small red squares. 


3.5.7. The tileset (R3) is a copy of (R2) where each tile “knows” whether it is pure, vertical unpure 
or horizontal unpure. Formally, the set of colors of (R3) is C = C((R2)) x {P, Vi, Vo, Hi, H2}, where 
C((R2)) is the set of colors of (R2), and P, Yi; and Hq} respectively mean “pure”, “vertical 
unpure" and “horizontal unpure”. To build (23), we follow these rules. 


e For each tile with a red line near the left, make all copies with (?4;, P} on the right and 
H5, Ho} on the left. 


e For each tile with a red line near the left, make all copies with {H2, Hı} on the right and 
(P, Hı} on the left. 


e For each other tile, make all copies of that tile with the same element of (P, H1, H2} on the 
left and on the right. 


(The rules to treat Y; 2 and vertical impurity are exactly symmetric.) To understand these rules, let 
us scan a big square (Figure 8.21) from left to right. The red line on the left is a right tile; the next 
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Figure 3.20: Two levels of hiararchy: yellow filling and arrows denote computational cells of the 1st 
level; violet filling and arrows—of the 2nd level. 


red line that we find may either be a left line, which means that we found the end of the big square 
and thus all the tiles in between were pure. The next red line we find may also be a right line, which 
means that we found the beginning of a smaller square. In this case, all the tiles in between are 
horizontally impure. The situation is symmetrical if we start from the right of the square. 

Finally, the tiles between two small squares are also vertically impure. Since the rule to detect 
them is a bit different, we use a separate color, Hə. The idea is that a tile to the right of a left tile 
is necesarily impure. 


Figure 3.21: How to locate horizontal unpure tiles. 


The result is a tileset (R3) that has a surjective projection to (R2), but which is partitioned in 
three subtilesets: the subset of pure tiles ( on every side), the subset of vertical unpure tiles (Y, or 
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Və on either horizontal side), and the subset of horizontal unpure tiles (Hı or Ha on either vertical 
side). 


3.5.8. Let P denote a Turing machine and U the tileset that forces to draw a space-time diagram 
of P(#) (this is tileset (75) in the Section b.4). All we have to do now is to merge U and (R3) into 
a tileset V, such that: 


e the computation of P(#) is only done on pure tiles; 


e the vertical (respectively horizontal, both kinds) unpure tiles transmit the information about 
the computation horizontally (respectively vertically, both directions); 


e the bottom, left-hand corner of each read square contains the head of a Turing machine. 


It is not hard to take the cartesian product of U and (R3), and then to remove the appropriate tiles 
to get the desired properties. 

Consequently, in any tiling of the plane by V, each red square contains a space-time diagram of 
P(#), whose size depends on the size of the square. If the computation is not finished when the end 
of the square is reached, it is simply interrupted. This is not a problem because we have arbitrarily 
large squares, so another, bigger square will exist somewhere and give more time to the computation. 


Theorem 3.5.9. The tileset V tiles the plane if and only if P(#) does not halt. Moreover, the 
transformation from P to V is computable. 


Proof. By Theorem "E any tiling of the plane by (R3), and thus by V, contains arbitrarily large 
red squares. A simple recurrence shows that the number of pure tiles inside a red square is strictly 
increasing with the size of the square. Therefore, any tiling of the plane by V has to contain arbitrarily 
large space-time diagrams of P(#). This is possible if and only if P(#) does not halt. 

To see that the transformation between P and V is computable, observe that the tileset (R3) is 
finite and known in advance. The tileset U can be produced from P using the algorithm from the 
previous chapter. Finally, V is a subset of the product of U and (R3), and this subset is easily seen 
to be computable. 
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3.e Exercises for Lecture 3 


Here are all the exercises from the lecture notes, reorganized and renumbered to make an exercise 
sheet. There are also a few new questions, to keep things interesting. Solve only what you like. 


Robinson tileset 


JIEHBEBIBHIFRBR 


Exercise 3.e.1. (a) Prove that there are infinitely many Robinson tilings of the plane. 


(b) Prove that there are uncountably many Robinson tilings of the plane. 
(c) Prove that no Robinson tiling has a period: thus, the Robinson tileset is aperiodic. 


(d) Show that there exists a Robinson tiling with a right-infinite red arrow: 


f 7 


(e) Show that there exists a Robinson tiling with a bi-infinite red arrow: 


Exercise 3.e.2. An emulation of R by Wang tiles is a pair (W, f) where W is a Wang tileset and f 


is a map W > R such that for any valid tiling T by W, the tiling f(T) by R is also valid. (Apply f 
on each tile of T to get f(T).) Design an emulation of R by Wang tiles, having: 


(a) < 64 tiles; 


(b) 56 tiles. 


Hierarchy 

If X and Y are two rectangles of tiles with the same height, then we define Das X DY := X | Y |. 
X 

If X and Y are two rectangles of tiles with the same width, then we define O as: X O Y := YI 


Given a tileset A, an (n x m)-substitution is a function s : A + A"*™ which maps each tile to a valid 
block of size n x m, so that for all tiles x, y in A: 


e rOyis valid if and only if s(x) O s(y) is valid; 


e xOy is valid if and only if s(x) O s(y) is valid. 


Exercise 3.e.3. Consider the tileset T = B. Xl. P. b: 


(a) find an (1 x 3)-subsitution for T; 


(b) find an (n x 3)-substitution for T, for all n > 2. 
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Exercise 3.e.4. Let A be a tileset, s be a substitution s: A > A”*” and T be a tiling of the plane 
by A. We say that T is a substitution tiling for s if and only if, for any block C appearing in T, there 
exists an integer n and a tile a such that C appears in s"(a). 


(a) Design a substitution tiling for the (2 x 3)-substitution from Exercise bed 
(b) Prove that there exist no substitution tiling for the (1 x 3)-substitution from Exercise Bed 


(c) Prove that, for any tileset A and any substitution s : A — A!*", there exist no substitution 
tiling for A and s. Same question if s is s: A — A"*!, 


Exercise 3.e.5. Let A denote a tileset, s denote a substitution and T denote a substitution tiling for 
A and s. We say that T has an unique derivation if there is exactly one tiling U such that T = s(U). 


(a) Show that the tiling you found for Exercise B.ed(a) doesn't have unique derivation. 
(b) Show that there is no tiling with unique derivation for the (2 x 3)-substitution from Exercise be3 


(c) Let To denote a substitution tiling with substitution s. Prove that if there is an unique infinite 
sequence of tilings 71, T5, T3,... such that: 


T = s(Tn+1) 


for all n in N (in particular, each T, has an unique derivation), then To is aperiodic. 


Contacts 


e Daria Pchelina (dpchelina@clipper.ens.fr) 
e Guilhem Gamard (guilhem.gamard@normale.fr) 


50 


Appendix A 


Bibliography 


Simon Marlow et al. The Haskell 2010 Language Report. Self-published, 2010. URL: 


/www.haskell.org/onlinereport/haske112010/. 


Robert Berger. “The undecidability of the domino problem”. In: Memoirs of the American 
Mathematical Society 66 (1966). His PhD thesis, with the same title, was defended in 1964 at 
Harvard university; it contains pretty much the same text. 


Leo Brodie. Starting forth. Prentice-Hall, 1981. URL: https://www.forth.com/starting- 
toro 


Emmanuel Jeandel and Michaël Rao. “An aperiodic tileset of 11 Wang tiles”. In: Arziv 1506.06492 
(2015). URL: https: //arxiv.org/abs/1506.06492. 
Pypy. URL: https: //pypy .org. 


Raphael M. Robinson. “Undecidability and Nonperiodicity for Tilings of the Plane”. In: /nven- 
tiones Mathematicae 12.3 (1971). 


Michael Sperber et al. Revised? Report on the Algorithmic Language Scheme. Cambridge Uni- 
versity Press, 2007. URL: http://www .r6rs.org/. 


Hao Wang. ^Dominoes and the AEA case of the decision problem". In: Computation, Logic, 
Philosophy: A Collection of Essays. Mathematics and its Applications (China Series). Springer, 
1989. 


Hao Wang. “Proving theorems by pattern recognition—IT”. In: Bell System Technical Journal 
40.1 (1961), pp. 1-41. 


51 


Appendix B 


List of figures 


1.1 The memory (filled with random numbers) and the head]. . .. .. .. .. ...... 5 


6 
8 
8 
2.1 A tileset and a (part of) a valid tiling by this tileset} .................. 2] 


2.2 A simple tiling. .. .. ....... llle s 22 


2.3 A sequence of elements of Z? that covers Z2]... 222. 23 
2.4 Arepeatablesquare]| . . .. .... ll ss 24 
D5. A line tiled by (TA) ideo oe BE we AR a 27 


26. A me leds by (2) e vs ho eh ee GK AR KR A ee A A E 27 
2.7 "Two consecutive lines tiled b for v — R. 


2.8 Two consecutive lines tiled by (TA) for v = LJ] ........ a a 30 


2.9 Transformation of a tile of (T5) into a tile of (T5) ............. nn 31 


2.10 This symmetric tiling does the computation of P(#) twice}... ............ 32 
3.1 Robinson tiles, up to rotation)... . .. ls 35 
3.2 Examples of adjacent tiles... 36 


3.3 Four tiles sharing a tome... 36 


3.4 A piece of a Robinson tiling] . . . .. .. ss 36 


3.5 Checking if two macrotiles may be neighbours/...................... 3T 
3.6 Iterating a projection of tiles on macrotiles| . . . .. ..... en 38 
8.7 Robinson's macrotiles ag, bo, cy and la 38 


3.8 The pattern o4: partial macrotilefor an. 39 
3.9 The pattern ag: a partial macrotile fora;| ....0.00000000000020200000042. 40 


52 


3.13 The macrotile a4 in (R1). Different shades of red are used to show the macrotiles of 


evel 2,3 A A .. .. ll ll e e e e sS ERR s 44 
3.14 A tiling by (R1): red squares with even (dotted) and odd (solid) red arrows] . . . .. 44 


3.15 The tileset (R2) is a modified version of ( R1), where the red arrows are closer to the 


3.16 Red squares of odd level tiled b 
3.17 Red squares of odd level tiled by CHI] . . i . au sound oa Yom ee A Oo 45 
3.18 The top border of a square is made of bottom tiles, and vice-versa} . . .. . ..... 45 


Ist level; violet filling and arrows—of the 2nd level] ..................... 47 
3.21 How to locate horizontal unpure tiles. 


53 


